A Data-Driven Method for Classifying Connective Phrases

نویسندگان

  • Alistair Knott
  • Chris Mellish
چکیده

This paper describes a three-stage methodology for investigating the semantics and pragmatics of sentence and clause connective phrases. The rst step in the methodology is to assemble a large corpus of connectives. The second step is to organise this corpus into a hierarchical taxonomy of synonyms and hyponyms, using a pre-theoretical substitution test. The nal step is to impose a theoretical interpretation on the taxonomy. The taxonomy lends itself to an analysis of intersentential/interclausal relations in terms of a number of orthogonal binary-valued features; connectives are then seen as signalling the values of one or more of these features. 2

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

A Data-driven Method for Crowd Simulation using a Holonification Model

In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...

متن کامل

Identifying Multiple Topics in Texts

In this paper, we present an innovative method for multi-label text classification. Our method uses Lucene to index texts and then assigns one or more classes to a new text based on its similarity relative to an annotated corpus. For finer granularity, we split the text into phrases, and then we focus on the noun phrases. Instead of classifying the entire text, we classify each noun phrase. The...

متن کامل

A New Approach for Scientific Citation Classification Using Cue Phrases

This paper introduces a new method for the rapid development of complex rule bases involving cue phrases for the purpose of classifying text segments. The method is based on Ripple-Down Rules, a knowledge acquisition method that proved very successful in practice for building medical expert systems and does not require a knowledge engineer. We implemented our system KAFTAN and demonstrate the a...

متن کامل

Sentiment Analysis Classification for Rotten Tomatoes Phrases on Kaggle

In the second assignment for CSE 190: Data Mining and Predictive Analytics, we apply some techniques to improve the accuracy of classifying Rotten Tomatoes phrase sentiments. General Terms Algorithms, Experimentation

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996